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DESCRIPTION OF THE INVENTION 

Field of the Invention 

This invention relates generally to pattern recognition. More particularly, it 
relates to apparatus and methods for pattern recognition based on transform 
5 aggregation. 

Background of the Invention 

Pattern recognition has been studied for over three decades. However, recent 
advances have allowed pattern recognition to become widely used in various 
applications, such as, face recognition systems, character/handwriting recognition 

10 systems, biometric recognition systems, video surveillance systems, etc. 

Fig. 1a shows a conventional pattern recognition system for recognizing faces. 
As shown in Fig. 1a, a camera 102 takes an image of scene 100 and provides an image 
signal to a recognizer 104. Typically, recognizer 104 includes a face detector module 
106 and a face matcher 108. 

15 Face detector module 104 detects a portion of the image signal (i.e., an image of 

scene 100) that is relevant for matching. For example, as illustrated in Fig. 1b, face 
detector module 106 may detect a face of a person in an image portion 130 of scene 
100. After detecting image portion 130, face detector module 104 then segments image 
portion 130 from the image signal. For example, as illustrated in Fig. 1c, face detector 

20 module 106 may isolate image portion 130, e.g., the person's face, to form an image 
segment 140. 



1 



Attorney Docket No.: 06720.0069 

Face matcher 108 receives image segment 140 from face detector module 106. 
Face matcher 108 includes a feature extractor module 1 10 and a matching module 112. 
Feature extractor module 110 extracts any relevant features identified in image segment 
140. For example, as illustrated in Fig. 1d, feature extractor module extracts features 
5 142, 144, 146, and 148, e.g., location of the eyes, distance between the eyes, location 
of the nose, and location of the mouth, from image segment 140. 

Matching module 112 searches a memory 1 14 to find a stored pattern (not 
shown), which matches image segment 140 based on features 142, 144, 146, and 148. 
Matching module 1 12 typically makes a decision as to which stored pattern or patterns 
10 match image segment 140 according to a predetermined decision rule. 

Output module 116 receives the decision from matching module 112 and outputs 
the decision to a user. For example, as shown in Fig. 1a, output module 116 may 
output the three closest matching faces to a display 118. 

Unfortunately, due to variation factors, conventional pattern recognition systems 
15 are often inaccurate. For example, variation factors such as scale, e.g., caused by a 
person being either closer or farther away, and rotation, e.g., caused by a person being 
slightly turned relative to camera 102, may cause the input image to not be matched to 
its corresponding stored pattern. Also, detecting relevant portions, e.g., image portion 
130, of certain types of images, such as gray-level images, may be incomplete or 
20 imprecise and may cause an image portion to have missing or extraneous features. 
Thus, conventional pattern recognition systems often produce erroneous recognition 
decisions. 
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In addition, conventional pattern recognition systems often use a single 
recognizer. A single recognizer makes a recognition decision based on a single 
recognition algorithm. Many efforts have been made to develop more sophisticated 
recognition algorithms to improve the performance of conventional single recognizer 
5 systems. However, such systems using sophisticated recognition algorithms are still 
prone to inaccurate results because of the above-mentioned variation factors. 

Stricter normalization of images during pre-processing has also been studied as 
a way to improve performance. For example, normalization of images may be used to 
minimize the effect of variation factors, e.g., such as scale and rotation. For example, 
10 images may be normalized to a fixed size. However, some variation factors, e.g., 

border shift, are difficult to detect. Also, even if a variation factor is detected, it may be 
difficult to accurately compensate for its effect to ensure an accurate recognition 
decision. 

Finally, some conventional pattern recognition systems may combine several 
15 recognizers using multiple recognition algorithms and/or modules thereof to enhance 
recognition accuracy. For example, combining several different recognizers with 
different matching modules may increase accuracy since the different recognizers may 
complement each other in a group decision. However, combining several recognizers is 
expensive to implement, and requires a large amount of memory. 
20 In addition, under certain circumstances, the combined recognizers may not 

complement each other, since they may disagree on a recognition decision. Combining 
several recognizers also requires a large amount of samples to "train" the recognizers to 
work together. Therefore, combining several recognizers often results in a complex 
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system, which may not perform well when only a relative small number of training 
examples are available. It is accordingly desirable to improve recognition accuracy 
using relatively simple and inexpensive systems. 

SUMMARY OF THE INVENTION 

5 In accordance with an aspect of the invention, a method of processing an input 

object for pattern recognition comprises: receiving an input object; segmenting a target 
object from the input object to form a segmented target object; performing at least one 
transform on the segmented target object to generate at least one transformed object; 
and outputting the segmented target object and the at least one transformed object to at 
10 least one pattern recognizer. 

In accordance with another aspect of the invention, a method of processing an 
input object for pattern recognition comprises: receiving an input object; detecting a 
target object within the input object; segmenting the target object from the input object 
to form a plurality of segmented target objects; and outputting the segmented target 
1 5 objects to at least one pattern recognizer. 

In accordance with another aspect of the invention, a method of aggregating a 
plurality of recognition results comprises: receiving a segmented target object and at 
least one transform of the segmented target object; performing at least one pattern 
recognition algorithm on the segmented target object and the at least one transform to 
20 generate a plurality of recognition results; aggregating the plurality of recognition results 
to determine a recognition decision; and outputting the recognition decision. 

In accordance with another aspect of the invention, a pattern recognition system 
comprises: an input to receive an input object; an object detector to detect a target 
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object within the input object; at least one transform module to perform at least one 
transform on the target object to form a plurality of transformed objects; at least one 
pattern recognizer for generating a plurality of recognition results based on the target 
image and the plurality of transformed objects; and a recognition result aggregator for 
5 determining a recognition decision based on the plurality of recognition results. 

In accordance with another aspect of the invention, pattern recognition system 
comprises: an input to receive an input object; an object detector to detect a target 
object and to form a plurality of segmented target objects; at least one pattern 
recognizer for generating a plurality of recognition results based on the plurality of 

1 0 segmented target objects; and a recognition result aggregator for determining a 
recognition decision based on the plurality of recognition results. 

In accordance with another aspect of the invention, a pattern recognition 
apparatus comprises: input means for receiving an input object; transform means for 
performing at least one transform on the input object to generate at least one 

15 transformed object; recognizer means for performing at least one pattern recognition 
algorithm on the input object and the at least one transformed object to generate a 
plurality of recognition results; aggregating means for aggregating the plurality of 
recognition results to determine a recognition decision; and decision output means for 
outputting the recognition decision. 

20 In accordance with another aspect of the invention, an apparatus for processing 

an input object for pattern recognition comprises: input means receiving an input object; 
segmentation means for segmenting a target object from the input object to form a 
segmented target object; transform means for performing at least one transform on the 
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segmented target object to generate at least one transformed object; and output means 
for outputting the segmented target object and the at least one transformed object to at 
least one pattern recognizer. 

In accordance with another aspect of the invention, an apparatus for processing 
5 an input object for pattern recognition comprises: input means for receiving an input 
object; segmentation segmenting at least one target object from the input object to form 
a plurality of segmented target objects; and output means for outputting the segmented 
target objects to at least one pattern recognizer. 

In accordance with another aspect of the invention, an apparatus for aggregating 
10 a plurality of recognition results comprises: means for receiving a segmented target 
object and at least one transform of the segmented target object; means for performing 
at least one pattern recognition algorithm on the segmented target object and the at 
least one transform of the segmented target object to generate a plurality of recognition 
results; means for aggregating the plurality of recognition results to determine a 
15 recognition decision; and means for outputting the recognition decision. 

Additional features and advantages of the invention will be set forth in part in the 
description which follows, and in part will be obvious from the description, or may be 
learned by practice of the invention. The features and advantages of the invention will 
be realized and attained by means of the elements and combinations particularly 
20 pointed out in the appended claims. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory only and are not restrictive 
of the invention, as claimed. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and constitute a part of 
this specification, illustrate several embodiments of the invention and together with the 
description, serve to explain the principles of the invention. 
5 Fig. 1a shows a conventional pattern recognition system for recognizing faces. 

Fig. 1b illustrates a conventional face detector module detecting a person's face. 
Fig. 1c illustrates a conventional pre-processor module segmenting an image 
portion from an input image into an object segment. 

Fig. 1d illustrates a conventional feature extractor module extracting features 
10 from an object segment. 

Fig. 2 shows one embodiment of a pattern recognition system utilizing multiple 
transforms of an input object consistent with the principles of the present invention. 

Fig. 3 shows an alternative embodiment of a pattern recognition system utilizing 
multiple segmentation of an input object consistent with the principles of the present 
15 invention. 

Fig. 4 shows another alternative embodiment of a pattern recognition system 
utilizing multiple segments of an input object and multiple transforms consistent with the 
principles of the present invention. 

Fig. 5 shows a method for pattern recognition performed by the system of Fig. 2 
20 in accordance with the principles of the present invention. 

Fig. 6 shows a method for pattern recognition performed by the system of Fig. 3 
in accordance with the principles of the present invention. 

Fig. 7 shows a method for pattern recognition performed by the system of Fig. 4 
in accordance with the principles of the present invention. 
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DETAILED DESCRIPTION 

Reference will now be made in detail to exemplary embodiments of the invention, 
examples of which are illustrated in the accompanying drawings. Wherever possible, 
the same reference numbers will be used throughout the drawings to refer to the same 
5 or like parts. 

Fig. 2 shows one embodiment of a pattern recognition system utilizing multiple 
transforms of an input object consistent with the principles of the present invention. An 
input object is received. The input object may represent a wide variety of objects in 
which a pattern can be recognized, such as, objects representing faces, handwriting, 

10 biometrics, etc. In addition, the input object may comprise a plurality of image samples, 
e.g., samples of a person's face (not shown). A target object, such as an image of a 
person's face, is detected and segmented from the input object. Processing such as 
pose estimation is performed on the segmented target object. One or more transforms 
are then performed on the segmented target object to create a set of transformed 

1 5 objects. The segmented target object and the set of transformed objects are then 

processed, e.g., normalized and features extracted. The segmented target object and 
set of transformed objects are searched against a database of stored images to create 
a set of matching results. The matching results are aggregated and a matching 
decision is made. The matching decision is then provided, e.g., to a user. 

20 In particular, Fig. 2 shows: an input 200; an object detector 202; a pose estimator 

204; a transform module 206; normalization modules 208r208n; extraction modules 
210i-210„; matching modules 212 r 212 n ; an accumulator 214; and an output module 
216. Input 200 receives an input object. In one embodiment, input 200 may receive 
one or more input objects representing an image of a person's face. Input 200 may 
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receive the input object from an external device, such as a camera (not shown). 
Alternatively, the device generating an input object may be integrated within input 200. 
Input 200 then provides the input object to object detector 202. 

Object detector 202 detects and segments a target object within the input object. 
5 Object detector 202 may be implemented by any combination of hardware and software 
and using known algorithms to detect the target object. For example, in one 
embodiment, object detector 202 uses known algorithms to detect a person's face as 
the target object. After detecting a target object (i.e., a person's face), object detector 
202 segments the target object to form a segmented target object. Object detector 202 

10 then provides the segmented target object to pose estimator 204. 

Pose estimator 204 determines a spatial orientation of the target object within the 
segmented target object. For example, the target object may be tilted or rotated at a 
particular angle, e.g., due to person's position relative to a camera. Pose estimator 204 
may use known algorithms to search for features within the target object such as a 

15 person's eyes, mouth, or nose. Based on the location and orientation of these features, 
pose estimator 204 calculates the spatial orientation of the target object within the 
segmented target object. Pose estimator 204 may also use other algorithms for pose 
estimation in accordance with the principles of the present invention. 

Transform module 206 then performs a plurality of transforms on the segmented 

20 target object based on the spatial relationship of the target object to generate a set of 
transformed objects. By generating a set of transformed objects, transform module 206 
may provide a more comprehensive set of images to minimize variation factors, such as 
an offset or rotation in the target object within the segmented target object. For 
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example, transform module 206 may perform transforms such as: a rotation transform; 
an affine transform; a left boundary shift transform; a right boundary shift transform; a 
top boundary shift transform; and a bottom boundary shift transform. For example, 
transform module 206 may perform the rotation transform when pose estimator 204 
5 calculates that the target object is turned at a particular angle within the segmented 
target object. Transform module 206 may also shift one or more of the boundaries (i.e., 
left, right, top, or bottom) when pose estimator 204 calculates that the target object is 
offset within the segmented target object. However, transform module 206 may perform 
any number of a wide variety of transforms either individually or in combination, in 
10 accordance with the principles of the present invention. For example, transform module 
206 may also combine one or more transform algorithms to form a more sophisticated 
transform. 

Transform module 206 then provides the set of transformed objects to 
normalization modules 208r208 n . Transform module 206 may distribute the set of 

1 5 transformed objects in a wide variety of ways. For example, transform module 206 may 
provide one or more transformed objects to each normalization module. Alternatively, 
transform module 206 may provide the entire set of transformed objects to each 
normalization module. 

Normalization modules 208i-208 n standardize transformed objects provided from 

20 transform module 206 to form, e.g., one or more standardized candidate images. For 
example, normalization modules 208 r 208 n may each use known algorithms to 
standardize the transformed objects to a particular size, gray scale, contrast, or 
brightness. However, normalization modules 208r208 n may use any of a wide variety 
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of algorithms in accordance with the principles of the present invention. After the 
transformed objects are normalized, each of the normalization modules 208 1 -208 n 
provide their candidate images to their corresponding feature extraction modules 21 0V 
210 n . 

5 Feature extraction modules 21 0 r 210n extract relevant features from the one or 

more candidate images. For example, feature extraction modules 210i-210 n may use 
known algorithms to extract e.g., location of the eyes, distance between the eyes, 
location of the nose, and location of the mouth as relevant features from a candidate 
image. Each of feature extraction modules 210i-210 n may use an identical algorithm. 

10 Alternatively, feature extraction modules 210 r 210 n may use different algorithms, e.g., 
based upon the underlying transform used to form the one or more candidate images. 

Matching modules 212 1 -212 I1 search a database of stored images (not shown) 
based on the features extracted by feature extraction modules 21 0^21 0 n and match a 
candidate image to one of the stored images. Matching modules 212r212 n then 

15 generates one or more recognition results based on the search. The recognition results 
may include one or more of the stored images which match or substantially match the 
respective candidate image. Matching modules 212i-212 n may use known algorithms 
to match a respective candidate image to a stored image. However, any of a wide 
variety of algorithms may be used in accordance with the principles of the present 

20 invention. Matching modules 212 r 212 n may each use the same algorithm. 

Alternatively, matching modules 212 r 212 n may use different algorithms based upon 
the underlying transform used to form the candidate image. For example, matching 
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module 2^2^ may use a different algorithm from matching module 212 n . Matching 

modules 212 r 212 n then submit their respective recognition results to accumulator 214. 

Accumulator 214 receives the recognition results from each of the matching 

modules 212r212 n and makes one or more recognition decisions. In one embodiment, 

accumulator 214 uses the following equations: 

D(x) = j, if Gj(Sj(F 0 (x)), . . . , Sj(F N (x))) = arg max G k (S k (F 0 (x)), . . . , S k (F n (x))) where: 

1<k<K 

D denotes a decision rule function; 

G k (x) denotes a given aggregation rule function indicating a possibility that x 
belongs to a class k; 

S k (x) denotes the probability that x belongs to a class k; 

F n (x) denotes an extracted feature of an nth transformed image, where F 0 (x) is 
the extracted feature of the original input pattern x; 

x denotes a segmented pattern in a segmented input object with N number of 
transformations performed, where n is 1< n < N; 

j denotes a matching object; 

k denotes a pattern class in a concerned pattern with K sets of pattern classes in 
a class domain, Ci, C k , where k is 1< k < K; and 

Gi(Sk(F 0 (x)), S k (F N (x))) = max S k (F n (x)) from 0 to N. 

Alternatively, accumulator 214 may use the following equation for aggregation of 
recognition results: 

Gi(S k (F 0 (x)), S k (F N (x))) = (1/N) z S k (F n (x)) from n = 0 to N. 

However, accumulator 214 may use a wide variety of algorithms for aggregating 
and making one or more recognition decisions in accordance with the principles of the 

12 
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present invention. Accumulator 214 then provides the one or more recognition 
decisions to output module 216. 

Output module 216 provides the one or more recognition results, e.g., to a user 
(not shown). Output module 216 may provide the one or more recognition results in a 
5 wide variety of ways, such as a display, an electronic file on a computer readable 
medium, or a hardcopy printout. For example, in one embodiment, output module 216 
is a screen display. However, any output module, which allows a user to receive and 
interpret the one or more recognition decisions, is in accordance with the principles of 
the present invention. 

10 Fig. 3 shows an alternative embodiment of a pattern recognition system utilizing 

multiple segmentation of an input object consistent with the principles of the present 
invention. An input object is received. A target object is detected within the input 
object. The input object is processed and segmented multiple times into a plurality of 
segmented target objects, e.g., to minimize variation factors. Each of the segmented 

15 target objects is processed, e.g., pose estimated, normalized and features extracted. 
Each of the segmented target objects is then searched against a database of stored 
images to create a set of matching results. The matching results are aggregated and a 
matching decision is made. The matching decision is then provided, e.g., to a user. 
In particular, Fig. 3 shows: input 200; an object detector 202a; pose estimators 

20 204 r 204 n ; normalization modules 208 1 -208 n ; extraction modules 21 0 r 21 0 n ; matching 
modules 212 r 212 n ; accumulator 214; and output module 216. Input 200 receives an 
input object. Input 200 then provides the input object to object detector 202a. Object 
detector 202a detects a target object within the input object and provides a plurality of 
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segmented target objects. For example, object detector 202a may use one or more 
known algorithms to detect a person's face as the target object. For each different 
algorithm, object detector 202a may form a different segmented target object and, thus, 
create a set of varying segmented target objects, e.g., from a single input object. The 
set of segmented target objects from object detector 202a provides a more 
comprehensive set of images, e.g., to minimize variation factors and to increase 
recognition accuracy. Accordingly, a transform module, such as transform module 206 
shown with respect to Fig. 2, in conjunction with object detector 202a is not required. 
However, a transform module may be implemented in conjunction with object detector 
202a, in accordance with the principles of the present invention. Object detector 202a 
may also execute an algorithm multiple times to develop a plurality of segmented target 
objects. Object detector 202a then provides the plurality of segmented target objects to 
pose estimators 204r204n. Object detector 202a may distribute the plurality of 
segmented target objects in a wide variety of ways. For example, object detector 202a 
may provide one or more segmented target objects to each pose estimator. 
Alternatively, object detector 202a may provide the entire plurality of segmented target 
objects to each pose estimator. 

Pose estimators 204i-204 n are similar to pose estimator 204 described above 
with respect to Fig. 2. In particular, pose estimators 204 r 204 n determine a spatial 
orientation of the target object within their respective segmented target object. 
Normalization modules 208 r 208 n then standardize the segmented target object to form, 
e.g., a set of candidate images. Normalization modules 208 r 208 n then provide their 
respective candidate images to feature extraction modules 210 1 -210 n . 
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As described above with respect to Fig. 2, feature extraction modules 210i-210 n 
extract relevant features in a corresponding candidate image. Matching modules 21 2 r 
21 2 n then search a database of stored images (not shown) based on the extracted 
relevant features and match their respective candidate image to one of the stored 
5 images. Matching modules 21 2 r 21 2 n generate one or more recognition results based 
on the search. Matching modules 212i-212„ then submit their respective recognition 
results to accumulator 214. 

As described above with respect to Fig. 2, accumulator 214 receives the 
recognition results from each of the matching modules 212 r 212n and makes one or 

10 more recognition decisions. Output module 216 then provides the one or more 
recognition decisions. 

Fig. 4 shows another alternative embodiment of a pattern recognition system 
utilizing multiple segments of an input object and multiple transforms consistent with the 
principles of the present invention. An input object is received. A target object is 

1 5 detected within the input object. The input object is then processed and segmented 
multiple times into a plurality of segmented target objects, e.g., to account for variation 
factors and increase recognition accuracy. Each of the segmented target objects is 
processed, e.g., pose estimated. Multiple transforms are also performed on each of the 
segmented target objects to form a plurality of transformed objects. Each of the plurality 

20 of transformed objects is then searched against a database of stored images to create a 
set of matching results. The matching results are aggregated and a matching decision 
is made. The matching decision is then provided, e.g., to a user. 
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In particular, Fig. 4 shows: input 200; object detector 202a; pose estimators 204 r 
204 n ; transform modules 206i-206 n ; normalization modules 208 1 -208 2n ; extraction 
modules 210 r 210 2n ; matching modules 212 1 -212 2n ; accumulator 214; and output 
module 216. Input 200 receives an input object. Input 200 then provides the input 
object to object detector 202a. As described above with respect to Fig. 3, object 
detector 202a detects a target object within the input object and provides a plurality of 
segmented target objects. Object detector 202a then distributes the plurality of 
segmented target objects to pose estimators 204i-204 n . As described above, pose 
estimators 204 1 -204 n determine a spatial orientation of the target object within their 
respective segmented target object. As described above, transform modules 206i-206 n 
may then perform a plurality of transforms on their respective segmented target object 
based on the spatial relationship of the target object to generate a set of transformed 
objects. Transform modules 206 r 206 n then distribute the set of transformed objects to 
their respective normalization modules, i.e., normalization modules 208 1 -208 2n . For 
purposes of illustration, Fig. 4 shows transform modules 206i-206 n distributing 
transformed objects to two sets of subsequent modules. For example, as shown in Fig. 
4, transform module 206i may distribute transformed objects, e.g., to normalization 
modules 208i and 208 2 , feature extraction modules 21 d and 210 2) and matching 
modules 212i and 212 2 while transform module 206 n may distribute transformed 
objects, e.g., to normalization modules 208^ and 208 2n , feature extraction modules 
210 2rv i and 210 2n , and matching modules 210 2n -i and 210 2n . Alternatively, any number 
of sets of intervening modules for each transform module may be used in accordance 
with the principles of the present invention. 
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As described above, normalization modules 208i-208 2n standardize the 
transformed objects to form, e.g., a set of candidate images. After the transformed 
objects are normalized, each of the normalization modules 208i-208 2n provide the 
candidate images to their respective feature extraction modules 210 1 -210 2n - Feature 
5 extraction modules 21 0i-21 0 2n extract relevant features in their corresponding candidate 
image and provide them to corresponding matching modules 212i-212 2n . 

As described above, matching modules 212 r 212 2n then may search a database 
of stored images (not shown) based on the extracted features and match their 
respective candidate image to one of the stored images. Matching modules 212 1 -212 2n 

10 then generates one or more recognition results based on the search. Matching modules 
212-212 2n then submit their respective recognition results to accumulator 214. 
Accumulator 214 receives the recognition results from each of the matching modules 
212-212 2n and makes one or more recognition decisions. Output module 216 then 
provides the one or more recognition decisions, e.g., to a user (not shown). 

15 Fig. 5 shows a method for pattern recognition performed by the system of Fig. 2 

in accordance with the principles of the present invention. In particular, in step 500, 
input 200 receives an input object and provides the input object to object detector 202. 
In step 502, object detector 202 detects a target object within the input object and forms 
a segmented target object. In step 504, pose estimator 204 estimates a pose for the 

20 target object within the segmented target object. 

In step 506, transform module 206 performs one or more transforms on the 
segmented input object, e.g., to account for variation factors. For example, let: T 
represent a segmented target object an image; T(x, y) is a pixel of T at an xth horizontal 
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and yth vertical; and T is a transform of T. Accordingly, a transform for rotating a 
segmented target object T by 9 degrees, may be defined as T'(x',y') = T(x,y) where 

x' = cose x + sinG y; and 

y' = -sine x + cose y. 

As another example, an affine transform, where a11, a12, a21, a22, b1, and b2 
may be arbitrary constants, may be defined as T'(x', y') = T(x, y) where 
x' = a11 x + a12 y + b1; and 
y' = a21 x + a22 y + b2. 

Transform module 206 may also perform transforms using known algorithms for 
scale adjustment, contrast adjustment, or boundary shift. However, any of a wide 
variety of transforms are in accordance with the principles of the present invention. The 
number and type of transforms performed by transform module 206 may be pre- 
determined or may be varied in accordance with the principles of the present invention. 
In step 508, normalization modules 208 r 208 n normalize their respective transformed 
object to form a candidate image. In step 510, feature extraction modules 210r210 n 
extract relevant features in their respective candidate images. In step 512, matching 
modules 212r212 n search a database to find one or more matches for their respective 
candidate images. In step 514, accumulator 214 aggregates the recognition results 
from matching modules 212^212,! and makes one or more recognition decisions. In 
step 516, output module 216 provides the one or more recognition decisions. 

Fig. 6 shows a method for pattern recognition performed by the system of Fig. 3 
in accordance with the principles of the present invention. In particular, in step 600, 
input 200 receives an input object and provides the input object to object detector 202a. 
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In step 602, object detector 202a executes one or more algorithms to detect a target 
object and forms a plurality of segmented target objects, e.g., to account for variation 
factors. In step 604, pose estimator modules 204r204 n estimate a pose for the target 
object within their respective segmented target objects. 
5 In step 606, normalization modules 208i-208 n normalize their respective 

segmented target objects to form a candidate image. In step 608, feature extractor 
modules 210 r 210 n extract relevant features in their respective candidate image. In 
step 610, matching modules 212 1 -212 n search a database to find one or more matches 
for their respective candidate image. In step 612, accumulator 214 aggregates the 

10 recognition results from matching modules 212 1 -212 n and makes one or more 
recognition decisions. In step 614, output module 216 provides the one or more 
recognition decisions. 

Fig. 7 shows a method for pattern recognition performed by the system of Fig. 4 
in accordance with the principles of the present invention. In particular, in step 700, 

1 5 input 200 receives an input object and provides the input object to object detector 202a. 
In step 702, object detector 202a executes one or more algorithms to detect a target 
object and forms a plurality of segmented target objects. In step 704, pose estimator 
modules 204r204 n estimate a pose for the target object within their respective 
segmented target objects. In step 706, transform modules 206 1 -206 n perform one or 

20 more transforms on their respective segmented input objects to form a plurality of 
transformed objects. 

In step 708, normalization modules 208 1 -208 2n -i normalize their respective 
transformed objects, e.g., to form a candidate image. In step 710, feature extractor 
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modules 210i-210 2n -i extract relevant features in their respective candidate images. In 
step 712, matching modules 212i-212 2n -i search a database to find one or more 
matches for their respective candidate image. In step 714, accumulator 214 aggregates 
the recognition results from matching modules 212r212 2 n-i and makes one or more 
5 recognition decisions. In step 716, output module 216 provides the one or more 
recognition decisions. 

In accordance with the present invention, a pattern recognition system makes 
one or more recognition decisions utilizing a plurality of transforms, a plurality of 
segmented target objects, or a combination of both, e.g., to account for variation factors. 

10 Accordingly, methods, apparatus and systems implemented consistent with the present 
invention can make more accurate recognition decisions. For example, in a face 
recognition experiment comparing the accuracy of a conventional recognition system 
with a pattern recognition system and method implemented consistent with the present 
invention, the following results were obtained as shown in Table 1 below: 

15 Table 1 



Person 


Conventional Method 
Recognition Accuracy 


Inventive Method 
Recognition 
Accuracy 


1 


81.66% 


100.00% 


2 


95.48% 


100.00% 


3 


97.56% 


100.00% 


4 


89.85% 


90.90% 


5 


97.68% 


100.00% 


6 


88.92% 


92.85% 


7 


88.49% 


90.00% 


8 


98.76% 


100.00% 
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9 


on 

yu.*+o /o 


4 nn nno/ 
IUU.UU /o 


10 


Q7 99% 
y / .z^ /o 


a nn nno/ 
1UU.UU% 


11 


Q7 ftfto/ 

y / .oD /o 


h nn nno/ 
1 UU.UUto 


19 


QQ 710/ 

yy. / i /o 


h nn nno/ 
1UU.UU% 


13 


yo.HU /o 


^ nn nno/ 
1 UU.UUTo 


14 


Q9 R1% 
yz.u i /o 


1 nn nno/ 
I UU.UUto 


15 


yo.o i /o 


1 nn nno/ 
IUU.UU /o 


1fi 


no p-10/ 

yo.o i /o 


h nn nno/ 
IUU.UU % 


17 


Q"7 COO/ 
OY.OOTO 


100.00% 


18 


QQ P.Q0/ 

yy.oo /o 


a nn nno/ 
1 UU.UUTo 


1Q 


01 910/ 


-i nn nno/ 
IUU.UUto 


90. 


QO O>10/ 


4 nn nno/ 
IUU.UU To 


91 

Z 1 


yo.4o7o 


100.00% 


99 

ZZ 


QQ -ICO/ 

oy.TO /o 


4 nn nno/ 
1U0.U0% 


9^ 

ZO 


on 9/10/ 
0U.Z4 /o 


QQ QQO/ 
00.08 TO 


94 
z*+ 


Q7 n^o/ 
y f .uo /o 


-i nn nno/ 
1UU.00 To 


9^ 


QQ 910/ 

yy.z i /o 


4 nn nno/ 

100.00% 


9fi 

ZU 


Q7 P.9 

y / .oz 


4 nn nn 
1UU.U0 


97 
z f 


Q1 £7 

y i .0/ 


nn nn 

90.90 


Zo 


yo.zo 


100.00 


9Q 

zy 


no do 
98.96 


100.00 


^n 
ou 


yo.b4 


100.00 


01 


84.32 


100.00 


oz 


nn oo 
99.33 


100.00 


33 


QQ 90 


1 nn nn 

I UU.UU 


34 


95.58 


100.00 


35 


65.43 


77.77 
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36 


Q7 R9 


■i nn nn 
1 uu.uu 


37 


77 4ft 
/ i .to 


pn nn 
ou.uu 


38 


98.34 


1 nn nn 
I uu.uu 


39 


96 01 


•inn nn 
I uu.uu 


40 


UC7.0 / 


a nn nn 
I uu.uu 


41 


ftO R7 


o I .on 


42 


17 1 .JO 


on 
yu.yu 


43 




inn nn 
I uu.uu 


44 


98.25 


100.00 


45 


98.07 


100.00 


46 


99.38 


100.00 



Other embodiments of the invention will be apparent to those skilled in the art 
from consideration of the specification and practice of the invention disclosed herein. It 
is intended that the specification and examples be considered as exemplary only, with a 
true scope and spirit of the invention being indicated by the following claims. 
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